Modeling of audiovisual speech perception in noise

نویسندگان

Tobias S. Andersen

Kaisa Tiippana

Jouko Lampinen

Mikko Sams

چکیده

We present three models of audiovisual speech perception at varying signal-to-noise ratios (SNR). The first model is Massaro’s Fuzzy Logical Model of Perception (FLMP) applied at each SNR. The second model imposes the constraint that the visual response probabilities are the same regardless of the SNR. Both models describe the data well. Root Mean Squared Error (RMSE) corrected for the numbers of degrees of freedom was smaller for the latter model. In concordance, cross validated paired t-test showed that the latter model was significantly better at predicting individual performance despite the lower number of parameters. In a third model – a weighted FLMP – the SNR is parameterized reducing the number of free parameters substantially. This model fits the data significantly worse than the other two models, but does capture salient features of the change in performance with varying SNR.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reliability of Interaural Time Difference-Based Localization Training in Elderly Individuals with Speech-in-Noise Perception Disorder

Background: Previous studies have shown that interaural-time-difference (ITD) training can improve localization ability. Surprisingly little is, however, known about localization training vis-à-vis speech perception in noise based on interaural time difference in the envelope (ITD ENV). We sought to investigate the reliability of an ITD ENV-based training program in speech-in-noise perception a...

متن کامل

Effect of signal to noise ratio on the speech perception ability of older adults

Background: Speech perception ability depends on auditory and extra-auditory elements. The signal-to-noise ratio (SNR) is an extra-auditory element that has an effect on the ability to normally follow speech and maintain a conversation. Speech in noise perception difficulty is a common complaint of the elderly. In this study, the importance of SNR magnitude as an extra-auditory effect on speech...

متن کامل

Envelope-based inter-aural time difference localization training to improve speech-in-noise perception in the elderly

Background: Many elderly individuals complain of difficulty in understanding speech in noise despite having normal hearing thresholds. According to previous studies, auditory training leads to improvement in speech-in-noise perception, but these studies did not consider the etiology, so their results cannot be generalized. The present study aimed at investigating the effectiveness of envelope-b...

متن کامل

Audiovisual processing of Lombard speech

Perception results are presented that address the role of Lombard speech in auditory and audiovisual speech perception. Basically, visual enhancement neutralizes the advantage of Lombard speech observed for auditory perception. It remains an open question whether or not Lombard speech is preferable for perception studies of speech in noise.

متن کامل

Time is of the essence in speech perception! Get it fast, or think about it

Speech recognition occurs when attending to speech stimuli in auditory, visual, or audiovisual modalities under optimum (e.g., in silence) or degraded listening conditions (i.e., in background noise or in individuals with hearing impairment). The present thesis contains details of the first study to show how background noise (steady-state white noise) delayed the identification of different typ...

متن کامل

Audiovisual Lombard speech: reconciling production and perception

An earlier study compared audiovisual perception of speech ’produced in environmental noise’ (Lombard speech) and speech ’produced in quiet’ with the same environmental noise added. The results and showed that listeners make differential use of the visual information depending on the recording condition, but gave no indication of how or why this might be so. A possible confound in that study wa...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Modeling of audiovisual speech perception in noise

نویسندگان

چکیده

منابع مشابه

Reliability of Interaural Time Difference-Based Localization Training in Elderly Individuals with Speech-in-Noise Perception Disorder

Effect of signal to noise ratio on the speech perception ability of older adults

Envelope-based inter-aural time difference localization training to improve speech-in-noise perception in the elderly

Audiovisual processing of Lombard speech

Time is of the essence in speech perception! Get it fast, or think about it

Audiovisual Lombard speech: reconciling production and perception

عنوان ژورنال:

اشتراک گذاری